Corpus: kat-ge_web_2019_300K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 97 99 99 99 99
1000 782 864 889 909 928
10000 6521 8547 8868 9007 9396
100000 45279 88333 96389 97751 98445
1000000 101654 248818 287314 294568 296622


Zipf's diagram for sentence endings


Gnuplot diagram

19273 msec needed at 2020-06-02 00:19